An Algebraic Topological Approach to Privacy: Numerical and Categorical Data
نویسندگان
چکیده
In this paper, we cast the classic problem of achieving k-anonymity for a given database as a problem in algebraic topology. Using techniques from this field of mathematics, we propose a framework for k-anonymity that brings new insights and algorithms to anonymize a database. We begin by addressing the simpler case when the data lies in a metric space. This case is instrumental to introduce the main ideas and notation. Specifically, by mapping a database to the Euclidean space and by considering the distance between datapoints, we introduce a simplicial representation of the data and show how concepts from algebraic topology, such as the nerve complex and persistent homology, can be applied to efficiently obtain the entire spectrum of k-anonymity of the database for various values of k and levels of generalization. For this representation, we provide an analytic characterization of conditions under which a given representation of the dataset is k-anonymous. We introduce a weighted barcode diagram which, in this context, becomes a computational tool to tradeoff data anonymity with data loss expressed as level of generalization. Some simulations results are used to illustrate the main idea of the paper. We conclude the paper with a discussion on how to extend this method to address the general case of a mix of categorical and metric data.
منابع مشابه
Functorial semantics of topological theories
Following the categorical approach to universal algebra through algebraic theories, proposed by F.~W.~Lawvere in his PhD thesis, this paper aims at introducing a similar setting for general topology. The cornerstone of the new framework is the notion of emph{categorically-algebraic} (emph{catalg}) emph{topological theory}, whose models induce a category of topological structures. We introduce t...
متن کاملON ALGEBRAIC AND COALGEBRAIC CATEGORIES OF VARIETY-BASED TOPOLOGICAL SYSTEMS
Motivated by the recent study on categorical properties of latticevalued topology, the paper considers a generalization of the notion of topological system introduced by S. Vickers, providing an algebraic and a coalgebraic category of the new structures. As a result, the nature of the category TopSys of S. Vickers gets clari ed, and a metatheorem is stated, claiming that (latticevalu...
متن کاملCategorically-algebraic topology and its applications
This paper introduces a new approach to topology, based on category theory and universal algebra, and called categorically-algebraic (catalg) topology. It incorporates the most important settings of lattice-valued topology, including poslat topology of S.~E.~Rodabaugh, $(L,M)$-fuzzy topology of T.~Kubiak and A.~v{S}ostak, and $M$-fuzzy topology on $L$-fuzzy sets of C.~Guido. Moreover, its respe...
متن کاملBernoulli collocation method with residual correction for solving integral-algebraic equations
The principal aim of this paper is to serve the numerical solution of an integral-algebraic equation (IAE) by using the Bernoulli polynomials and the residual correction method. After implementation of our scheme, the main problem would be transformed into a system of algebraic equations such that its solutions are the unknown Bernoulli coefficients. This method gives an analytic solution when ...
متن کاملPreserving Micro Data Release: Categorical and Numerical Data
Data mining techniques, in spite of their benefit in a wide range of applications have also raised threat to privacy and data security. All the attributes in a data base table can be classified into three categories as identifying attributes, sensitive attributes and quasi-identifier attributes. KAnonymity is the popular approach for privacy preserving data mining and the problems with Kanonymi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1602.06643 شماره
صفحات -
تاریخ انتشار 2016